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Description 

METHOD OF VIDEO COMPRESSION THAT 
ACCOMMODATES SCENE CHANGES 

Background of Invention 
[000 1 ] 1. Field of the Invention 

[0002] The present invention relates to digital video, and more 

specifically, to a method of video frame compression. 
[0003] 2. Description of the Prior Art 

[0004] Digital video is a popular means of information communi- 
cation. Computer-based video conferencing and digital 
television are just two examples of applications of this 
technology. As a single digital image can contain hun- 
dreds of thousands of pixels and many sequential image 
frames are required to produce a quality video effect, 
compression schemes are required for efficiency in pro- 
cessing time and storage space. 

[0005] | n general, image compression is effected by determining 
correlations among regions of pixels. When high com- 



pression is required due to limits in data transmission 
speeds for example, either video quality must be sacri- 
ficed or additional compression hardware and software 
must be provided. Where low amounts of compression are 
tolerable, video quality can be maintained at the cost of 
an increase in memory demand. Among this complex ar- 
rangement of trade-offs, any increase in compression that 
does not significantly lower video quality or increase the 
necessary hardware is undoubtedly beneficial. 
[0006] Practically, compression of a video frame can be realized 
by intra-picture encoding in which a block of pixels is 
compressed referencing pixel information outside the 
block but in the same frame, and by inter-picture encod- 
ing in which a block is compressed referencing pixel in- 
formation of other frames. In compression standards, 
such as MPEG-4, it is convenient to define several types of 
frames each having different compression schemes. Com- 
monly, these types of frames are defined as: an entirely 
intra-picture encoded frame (l-frame), a frame having a 
mixture of intra-picture and inter-picture encoded blocks 
(P-frame), and a frame of only inter-picture encode blocks 
(B-frame). The l-frame is the least compressed and has 
the highest picture quality, the B-frame is the most com- 



pressed and sacrifices some picture quality, and the P- 
frame is fairly balanced in between. Such definitions allow 
for controlling the trade-off between compression and 
quality in systems such as MPEG-4. 

[0007] Once frame type definitions are decided upon, a predeter- 
mined sequence, which is typically repeating, of these 
frames is used to encode a video. Please refer to Fig.l il- 
lustrating a sequence 10 of five frames 12-20. Frame 12 
is an l-frame, frames 14, 16, 18 are B-frames, and frame 
20 is a P-frame. The l-frame 12 is encoded first, its 
blocks (i.e. intra block 22) being encoded referencing in- 
frame picture information only. The B-frames 14, 16, 18 
are encoded next. The blocks of the B-frames (i.e. inter 
blocks 24, 26, 28) are encoded using motion vectors ref- 
erencing picture information of the l-frame 12. Finally, 
the P-frame is encoded using a suitable combination of 
intra-picture encoding (i.e. block 30) and inter-picture 
encoding (i.e. block 32). With such an "l-B-B-B-P" frame 
sequence, a reasonable balance of compression and qual- 
ity can be maintained. 

[0008] a major problem with this method occurs at scene 
changes, such as those that frequently occur during 
movies. From a video compression standpoint, a scene 



change can be loosely defined as: when the pictures to be 
encoded change drastically between two neighboring 
frames. If the scene of the video changes at a B-frame, 
the reference of that B-frame (the preceding l-frame) 
likely contains little useful information with which to en- 
code that B-frame (Recall that B-frames consist entirely of 
inter-picture encoded blocks with corresponding motion 
vectors referring back to a reference frame.). For example, 
if the l-frame 12 (Fig.l) contains an image of an automo- 
bile travelling down a highway, and the B-frame 14 is re- 
quired to have a similar image with the automobile dis- 
placed slightly, there is no problem. However, if the scene 
then changes and the B-frame 16 is required to have an 
image of a man eating a salad, poor encoding of the B- 
frame 16 will result as the l-frame 12 holds little useful 
image information with which to perform inter-picture 
encoding. Thus, scene changes occurring in B-frames de- 
grade overall video quality. 
Summary of Invention 

[0009] it is therefore a primary objective of the claimed invention 
to provide a method of video compression capable of 
handling a scene change occurring in an inter-coded 
frame (B-frame) to improve video quality. 



[0010] Briefly summarized, the claimed invention method in- 
cludes providing a predetermined sequence of video 
frames, the predetermined sequence beginning with an I- 
frame and ending with a P-frame. An l-frame is defined as 
having blocks encoded referencing intra-picture informa- 
tion only and a P-frame is defined as having blocks en- 
coded referencing intra-picture or inter-picture informa- 
tion. Further, a B-frame is defined as having blocks en- 
coded referencing inter-picture information only. The 
method further includes encoding blocks of each frame 
according to the order of the frame in the sequence and 
the frame type, determining a number of intra-picture 
coded blocks in an encoded P-frame, and determining 
that a scene change occurs when the number of intra- 
picture coded blocks is greater than a predetermined 
number. The method finally includes redefining the P- 
frame detected as having a scene change as an l-frame, 
redefining all B-frames of the sequence as P-frames, and 
re-encoding redefined frames. 

[0011] According to the claimed invention, method further in- 
cludes detecting for a scene change in the newly redefined 
frames. Such an iterativeapproach allows the exact frame 
in which the scenechange occurs to be determined. 



[0012] | t j S an advantage of the claimed invention that redefining 
frames allows a scene change to be accurately encoded 
thereby improving overall video quality. 

[0013] These and other objectives of the claimed invention will 

no doubt become obvious to those of ordinary skill in the 

art after reading the following detailed description of the 

preferred embodiment that is illustrated in the various 

figures and drawings. 
Brief Description of Drawings 

[0014] pig.l is a schematic diagram of a sequence of video 

frames according to the prior art. 
[0015] pig. 2 is a block diagram of a video system for performing 

the present invention method. 
[0016] pig. 3 is a flowchart of the present invention method. 

[0017] Fig. 4 is a frame diagram relating to Fig. 3. 

[0018] Fig. 5 is a frame diagram of a video encoded according to 

the present invention method. 
Detailed Description 

[0019] Fig. 2 illustrates a video system 40 capable of performing 
the present invention compression method. The video 
system 40 comprises a random-access memory 44 and a 
processor 46 for executing a compression algorithm 48 



according to the present invention. The compression al- 
gorithm 48 may be stored in the RAM 44 or in a similar 
memory dedicated to the processor 46. The video system 
40 receives a raw video signal from a video source 42, 
such as a digital video camera, at the RAM 44 though an 
interface (not shown). The processor 46 then executes the 
compression algorithm 48 on the raw video data stored in 
the RAM 44 and outputs compressed video data to a dis- 
play or storage device 50 via an output interface (not 
shown). In practical applications, the video source 42, the 
RAM 44, processor 46, and the display 50 may be part of 
a digital camera or a personal computer system. Likewise, 
the source 42 may be a computer-based digital camera, 
with the RAM 44 and processor 46 being part of the com- 
puter, the display device 50 being a remote monitor to 
which compressed video is sent across a network. 

[0020] pig. 3 illustrates a flowchart of the present invention 

method. The process 100 described by the flowchart can 
be practically realized as the compression algorithm 48 of 
Fig. 3 and is described as follows: 

[0021] step 102: Select a frame sequence of I-, B-, and P-frames 
(for example, "l-B-B-B-P"); 

[0022] step 104: Select the first frame to be encoded (i.e. the I- 



frame); 

[0023] step 106: Encode the frame according to its type. Blocks 
of an l-frame are intra-picture encoded referencing only 
picture information of the current frame, blocks of a En- 
frame are inter-picture encoded referencing picture infor- 
mation of the previous l-frame, and blocks of a P-frame 
are intra- or inter-picture encoded depending on the rel- 
ative suitability of each; 

[0024] step 108: Check the type of the current frame being en- 
coded. This can be performed before, after, or during en- 
coding. If the current frame is a P-frame perform step 
114, otherwise continue with step 110; 

[0025] step 1 10:lf the current frame is the last frame to be en- 
coded, end the process. Otherwise, perform step 112; 

[0026] step 112:Select the next frame. Note that the frame se- 
quence is repeated such that another l-frame comes after 
the P-frame in the exemplary sequence; 

[0027] step 114: Is a scene change detected in the P-frame? That 
is, does the number of intra-picture encoded blocks ex- 
ceed a predetermined limit? If a scene change is detected, 
perform step 116. Otherwise, continue with step 110; 

[0028] step 116: Change the current frame from a P-frame to an 
l-frame, and change B-frames in the current frame se- 



quence to P-frames; 
[0029] step 118: Select the first of the newly converted P-frames. 

[0030] Regarding the P-frame encoding of step 106, determining 
whether a block is to be intra-picture encoded or inter- 
picture encoded is well known in the art. For example, a 
P-frame encoding algorithm may encode all blocks as in- 
ter-picture by default, unless such encoding results in a 
quality indicator value that is less than acceptable. The al- 
gorithm would select blocks with unacceptable inter- 
picture encoding quality to be encoded as intra-picture 
instead. 

[0031] Regarding the scene change detection in step 114, this 
can be performed in several ways. In the preferred em- 
bodiment, a running total of intra-picture encoded blocks 
is kept while encoding a P-frame. If the running total ex- 
ceeds a certain limit, then a scene change is said to be 
detected. For example, in a 640x480 pixel video having 
16x16 pixel non-overlapping blocks (1200 blocks total), 
the limit of intra-picture encoded blocks can be selected 
as 800. Thus, if an 801st block is to be intra-picture en- 
coded, a scene change is determined. Of course, another 
limiting value could be selected for a different tolerance to 
scene change detection. Note that, if a scene change is 



detected in the P-frame, the scene change may actually 
occur in the P-frame itself or in any preceding B-frame. 
The exact frame in which the scene change occurs is de- 
termined by the iterative nature of the process 100. 
[0032] please refer to Fig. 4 illustrating frame sequences as an 
example of the effect of the process 100. In Fig. 4, the 
frame sequence (frames 202-210 corresponding to an "I- 
B-B-B-P" sequence) is initially selected in step 102 of the 
process 100. Encoding begins with this sequence accord- 
ing to steps 104 and 106. Then, during step 114, the P- 
frame 210 is detected to have a scene change. Subse- 
quently, according to step 116, the frame sequence is re- 
vised to "l-P-P-P-l". That is, the format of the frames 204, 
206, 208 is changed from B-frame to P-frame and the 
format of the frame 210 is changed from P-frame to I- 
frame. The first P-frame of the new sequence, P-frame 
204, is selected in step 118 and encoding thus continues. 
Many of the blocks of a P-frame that was formerly a B- 
frame will be re-encoded as intra-picture blocks, how- 
ever, re-encoding blocks that are to remain inter-picture 
encoded is unnecessary. During continued encoding, step 
114 again detects the scene change based on the number 
of intra-picture blocks generated, however, this time it 



occurs in the second P-frame 206. Accordingly, the P- 
frame 206 is re-encoded as an l-frame. Since at this point 
no B-frames exist to be converted to P-frames, encoding 
continues until complete. 
[0033] pig. 5 illustrates a video 300 encoded according to the 

present invention method. The video 300 is made up of a 
series of frame sequences that are initially defined as "I- 
B-B-B-P". However, during compression, the process 100 
detects scene changes and changes frame types accord- 
ingly. The video 300 includes a frame sequence 302 hav- 
ing no scene change (which is typical), a frame sequence 
304 having a scene change in the third frame (as in Fig. 4), 
a frame sequence 306 having a scene change in the first 
frame, and a frame sequence 308 having a scene change 
detected in the fourth frame. Note that the present inven- 
tion method 100 handles the scene change of the frame 
sequence 304 occurring in the l-frame, even though no 
frame redefinitions are necessary. As a result, the exact 
frames in which scene changes occur are determined, 
these frames being encoded to preserve such scene 
changes. 

[0034] | n practical application, the present invention can be im- 
plemented as part of an MPEG-4 (moving picture experts 



group) video compression program or device. Implemen- 
tation of the above-described method 100 into an MPEG- 
4 system is straightforward to one of ordinary skill in the 
art. 

[0035] | n contrast to the prior art, the present invention redefines 
frames to accurately encode a scene change. Specifically, 
when a scene change is detected in a P-frame, the P- 
frame is converted to an l-frame and any B-frames in the 
same sequence are converted to P-frames. As a result, the 
exact frame in which the scene change occurs can be de- 
termined, and the scene change can be properly encoded. 

[0036] Those skilled in the art will readily observe that numerous 
modifications and alterations of the device may be made 
while retaining the teachings of the invention. Accord- 
ingly, the above disclosure should be construed as limited 
only by the metes and bounds of the appended claims. 



